Basic Statistics

Raw Counts

Name Value
Rows 1,530,447
Columns 29
Discrete columns 22
Continuous columns 7
All missing columns 0
Missing observations 518,759
Complete Rows 1,162,010
Total observations 44,382,963
Memory allocation 810.7 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (by frequency)

## 14 columns ignored with more than 50 categories.
## open_dt: 1406692 categories
## target_dt: 823703 categories
## closed_dt: 1369155 categories
## closure_reason: 1010447 categories
## case_title: 15679 categories
## reason: 61 categories
## type: 216 categories
## queue: 205 categories
## submittedphoto: 380100 categories
## closedphoto: 221128 categories
## location: 137778 categories
## ward: 57 categories
## precinct: 256 categories
## location_street_name: 134814 categories

QQ Plot

Correlation Analysis

## 17 features with more than 20 categories ignored!
## open_dt: 1081625 categories
## target_dt: 621482 categories
## closed_dt: 1053596 categories
## closure_reason: 768720 categories
## case_title: 10973 categories
## reason: 54 categories
## type: 205 categories
## queue: 193 categories
## submittedphoto: 268163 categories
## closedphoto: 174293 categories
## location: 124408 categories
## pwd_district: 22 categories
## police_district: 24 categories
## neighborhood: 24 categories
## ward: 45 categories
## precinct: 256 categories
## location_street_name: 121586 categories

Principal Component Analysis

## 13 features with more than 50 categories ignored!
## open_dt: 1081625 categories
## target_dt: 621482 categories
## closed_dt: 1053596 categories
## closure_reason: 768720 categories
## case_title: 10973 categories
## reason: 54 categories
## type: 205 categories
## queue: 193 categories
## submittedphoto: 268163 categories
## closedphoto: 174293 categories
## location: 124408 categories
## precinct: 256 categories
## location_street_name: 121586 categories